# 



AMENDMENTS TO THE CLAIMS 



(Currently Amended) A videoconfe 



irenci 



stations if said conferee i3 determined to be 



6. (Previously Added) The 
determining whether said conferee is s 



encing system comprising: 



a conference bridge for interconnect^ ng a plurality of remotely located videoconferencing 
stations; and 

means for determining whether a cohferee is speaking by analyzing based , at least in part, 
on whether visual lip movements of said conferee are reasonably consistent with an audio signal J 
from a conference station in which said cor feree is located so as to produce human speech ; and 

means for visually altering an imago of said conferee displayed in other conference 



speaking . 



videoconfejrence system of claim 5 wherein said means for 
peaking comprises a voice activity detector. 



7. (Previously Added) The videoconfenence system of claim 6 wherein said voice activity 
detector is implemented at each of said conference stations. 

8. (Previously Added) The videoconference system of claim 6 wherein said voice activity 
detector is implemented at each of said conference bridge. 

9. (Previously Added) The videoconfererice system of claim 6 wherein said voice activity 
detector includes image analysis and recognition software. 
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1 0. (Currently Amended) The videoconference system of claim & 29 wherein said means for 
visually altering said image comprises means for highlighting a border around said image of said 
conferee determined to be speaking. 

1 1 . (Previously Amended) A videoconijbrence station comprising: 

a transmitter to transmit a combined audio video signal to a videoconference bridge; and 
means for determining whether a conferee located at said videoconference station is 

speaking by analyzing whether visual lip i novements of said conferee are substantially consistent 

with an audio signal at said station so as 1 ;> indicate human speech. 

12. (Previously Added) The videocor ference station of claim 1 1 wherein said means for 
determining whether said conferee is speaking is a voice activity detector. 



13. (Previously Added) The videoconference 
detector includes image analysis and rec 



system of claim 12 wherein said voice activity 
ignition software. 



14. (Previously Amended) A method| of displaying images of a plurality of conferees in a 
videoconference system, comprising: 

determining whether a conferee ii speaking by analyzing a consistency between visual lip 

signal from a conference station in which said conferee 
is located such that the combination of lip ( movement and audio signal indicates human speech; 
and 



movements of said conferee and an audio 
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visually altering an image of said conferee that is displayed to other conferees when said 
conferee is determined to be speaking. 

15. (Previously Amended) A method of determining whether a conferee in a videoconference 
is speaking, comprising analyzing whether idsual lip movements of said conferee are reasonably 
consistent with an audio signal from a conf srence station in which said conferee is located such 
that the combination of lip movement and i udio signal indicates human speech. 

16. (New) A method of determining wl ether a conferee in a videoconference is speaking 
comprising: 

detecting that the conferee's lips arfe moving based on a visual image of the conferee's 

lips; 

detecting an audio signal associatec I with the conferee; and 

determining whether the conferee i j speaking based, at least in part, on whether the 
detected audio signal is substantially consistent with the detected movement of the conferee's 
lips. 

17. (New) The method of claim 16 further comprising: 
altering an image of the conferee that is displayed to other conferees if the conferee is 

determined to be speaking. 

1 8. (New) The method of claim 1 6 furtlier comprising: 
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1 



providing textual information to' identi 
determined to be speaking. 



the conferee to other conferees if the conferee is 



19. (New) A method for identifying whicq conferee in a video conference is speaking 
comprising: 

detecting a first audio signal associated with a first conferee; 
detecting a second audio signal assoc ated with a second conferee; 
comparing the first detected audio signal with the second detected audio signal to 

I 

determine which detected audio signal is louder; and 

providing an indication to the first cpnferee and the second conferee of which detected 
audio signal is louder. 



20. (New) The method of claim 19, wherein providing an indication to the first conferee and 



the second conferee of which detected audio signal is louder comprises: 

altering an image of either the first conferee or the second conferee based, at least in part, 
on which detected audio signal is determined to be louder. 



21 . (New) The method of claim 20, wherein altering the image of either the first conferee or 
the second conferee based, at least in part, oi which detected audio signal is determined to be 
louder comprises: 



highlighting a border around the image of either the first conferee or the second conferee 

i 

based, at least in part, on which received audio signal is determined to be louder. 
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22. (New) The method of claim 1 9, whereip 
the second conferee of which detected audio si j 



providing textual information to the fiijst 
which detected audio signal is determined to 



providing an indication to the first conferee and 
gnal is louder comprises: 

conferee and the second conferee indicating 
e louder. 



23. (New) A videoconferencing system comprising: 

a conference bridge for interconnecting a plurality of remotely located videoconferencing 
stations; and 

a voice activity detector to determinfe whether a conferee is speaking based, at least in 
part, on whether lip movements of the conferee are substantially consistent with an audio signal 
from a conference station in which said conferee is located. 



24. (New) The videoconference system of claim 23, wherein the voice activity detector is 
implemented at each of the plurality of vi stations. 



25. (New) The videoconference systajh of claim 23, wherein the voice activity detector is 
implemented at the conference bridge. 



26. (New) A videoconference station comprising: 

a transmitter to transmit a combined audio/video signal to a videoconference bridge; and 
a voice activity detector to analyze whether visual lip movements of a conferee are 

substantially consistent with an audio signal at the videoconference station so as to indicate 

human speech. 
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27. (New) The videoconference station of claim 26, wherein the voice activity detector 
includes image analysis and recognition software. 



28. (New) The videoconference station o 

a display unit to provide a visual 
videoconference. 



claim 26, further comprising: 
representation of conferees participating in a 



29. (New) The videoconferencing system i 
means for visually altering an image 



of claim 5, further comprising: 
of said conferee displayed in other conference 



stations if said conferee is determined to be 



peaking. 
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